skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "McAfee, Bennett J"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract. Water quality in lakes is an emergent property of complex biotic and abiotic processes that differ across spatial and temporal scales. Water quality is also a determinant of ecosystem services that lakes provide and is thus of great interest to ecologists. Machine learning and other computer science techniques are increasingly being used to predict water quality dynamics as well as to gain a greater understanding of water quality patterns and controls. To benefit the sciences of both ecology and computer science, we have created a benchmark dataset of lake water quality time series and vertical profiles. LakeBeD-US contains over 500 million unique observations of lake water quality collected by multiple long-term monitoring programs across 17 water quality variables from 21 lakes in the United States. There are two published versions of LakeBeD-US: the “Ecology Edition” published in the Environmental Data Initiative repository (https://doi.org/10.6073/pasta/c56a204a65483790f6277de4896d7140, McAfee et al., 2024) and the “Computer Science Edition” published in the Hugging Face repository (https://doi.org/10.57967/hf/3771, Pradhan et al., 2024). Each edition is formatted in a manner conducive to inquiries and analyses specific to each domain. For ecologists, LakeBeD-US: Ecology Edition provides an opportunity to study the spatial and temporal dynamics of several lakes with varying water quality, ecosystem, and landscape characteristics. For computer scientists, LakeBeD-US: Computer Science Edition acts as a benchmark dataset that enables the advancement of machine learning for water quality prediction. 
    more » « less
    Free, publicly-accessible full text available January 1, 2026
  2. LakeBeD-US: Ecology Edition is a harmonized lake water quality dataset containing time series and vertical profiles of 21 lakes in the United States monitored by long-term monitoring institutions. These institutions include the North Temperate Lakes Long-Term Ecological Research program (NTL-LTER), Niwot Ridge Long-Term Ecological Research program (NWT-LTER), National Ecological Observatory Network (NEON), and the Carey Lab at Virginia Tech as part of the Virginia Reservoirs Long-Term Research in Environmental Biology (LTREB) site in collaboration with the Western Virginia Water Authority. The data include depth-discrete observations of 17 water quality variables including temperature, dissolved oxygen, chemical properties, Secchi depth, and more. Observations are divided into data collected by automated sensors at a relatively high temporal frequency and manually sampled data at a relatively low temporal frequency. All data were collected in situ. The data are available as Apache Parquet files, and the included R scripts give guidance on how to utilize and query the dataset in R. LakeBeD-US: Ecology Edition is an ecological science-oriented companion to LakeBeD-US: Computer Science Edition. The Computer Science Edition is available on the Hugging Face Hub. 
    more » « less